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In two-prover one-round interactive proof systems, no-signaling provers are those who are allowed to use 
arbitrary strategies, not limited to local operations, as long as their strategies cannot be used for communica- 
, tion between them. Study of multi-prover interactive proof systems with no-signaling provers is motivated by 

study of those with provers sharing quantum states. The relation between them is that no-signaling strategies 
\ include all the strategies realizable by provers sharing arbitrary entangled quantum states, and more. 

This paper shows that two-prover one-round interactive proof systems with no-signaling provers only 
accept languages in PSPACE. Combined with the protocol for PSPACE by Ito, Kobayashi and Matsumoto 
(CCC 2009), this implies MIP ns (2, 1) = PSPACE, where MIP ns (2, 1) is the class of languages having a 
two-prover one-round interactive proof system with no-signaling provers. This is proved by constructing a 
fast parallel algorithm which approximates within an additive error the maximum value of a two-player one- 
round game achievable by cooperative no-signaling players. The algorithm uses the fast parallel algorithm 
for the mixed packing and covering problem by Young (FOCS 2001). 

\o ' 
m 

(N ; 1 Introduction 

00 

O I 1.1 Background 

On . 

Nonlocality [|Bel641 is a peculiar property of quantum mechanics and has applications to quantum information 
processing. Following Cleve, H0yer, Toner and Watrous [CHTW04], quantum nonlocality can be naturally 
expressed in terms of cooperative two-player one-round game with imperfect information, which is a game 
played by two players and a referee as follows. The players are kept in separate rooms so that they cannot 
communicate with each other. The referee chooses a pair of questions according to some probability distribution, 
and sends one question to each player. Each player replies with an answer to the referee, and the referee declares 
whether the two players jointly win or jointly lose according to the questions and the answers. The players 
know the protocol used by the referee including the probability distribution of the pair of questions and how the 
referee determines the final outcome of the game, but none of the players knows the question sent to the other 
player. The aim of the players is to win the game with as high probability as possible, and the maximum winning 
probability is called the value of the game. In this framework, a Bell inequality is an inequality stating an upper 
bound of the value of a game of this kind when provers are not allowed to perform any quantum operations, 
and the violation of a Bell inequality means that the game value increases when provers are allowed to share a 
quantum state before the game starts. 

The complexity of finding or approximating the value of a game has been one of the most fundamental 
problems in computational complexity theory. The computational model based on cooperative multi-player 
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games is called multi-prover interactive proof systems and were introduced by Ben-Or, Goldwasser, Kilian and 
Wigderson MBGKW88II for a cryptographic purposeQ It turned out that this computational model is extremely 
powerful: multi-prover interactive proof systems exactly characterize NEXP |FRS94~i rBFL91ll , even in the most 
restricted settings with two provers, one round and an exponentially small one-sided error HFL92II . In other 
words, given the description of a cooperative game, approximating the best strategy even in a very weak sense 
is notoriously difficult. These results were built on top of techniques developed in studies on (single-prover) 
interactive proof systems [Bab85, GMR89, LFKN92, |Sha92|| as well as multi-prover interactive proof systems 
with weaker properties I CCL941 |Fei9 1 1 IDS 971 . It is noteworthy that the powerfulness of multi-prover one-round 
interactive proof systems has led to a successful study of probabilistically checkable proof systems [BF LS911 
FGLSS96 ], which play a central role in proving NP-hardness of many approximation problems via the celebrated 
PCP theorem llAS9lllALMSS98l . 

Cleve, H0yer, Toner and Watrous [CHTW04] connected the computational complexity theory and the quan- 
tum nonlocality and raised the question on the complexity of approximating the value of a cooperative game 
with imperfect information in the case where the players are allowed to share quantum states or, in terms of 
interactive proof systems, the computational power of multi-prover interactive proof systems with entangled 
provers. Kobayashi and Matsumoto [KM03] considered another quantum variation of multi-prover interactive 
proof systems where the verifier can also use quantum information and can exchange quantum messages with 
provers, which is a multi-prover analogue of quantum interactive proof systems ITWat031 . In MKM03I1 . it was 
shown that allowing the provers to share at most polynomially many qubits does not increase the power of 
multi-prover interactive proof systems beyond NEXP (even if the verifier is quantum). Although studied in- 
tensively HKKMTV08I ICGJ09I IIKPSY08I IKRT081 IKKMV09I IGut09l IDLTW08I INPA08I IBHP08I IIKM09L the 
power of multi-prover interactive proof systems with provers allowed to share arbitrary quantum states has been 
still largely unknown. 

The notion of no-signaling strategies was first studied in physics in the context of Bell inequalities by Khalfin 
and Tsirelson [KT85] and Rastall [Ras85 ], and it has gained much attention after reintroduced by Popescu 
and Rohrlich [PR94]. The acceptance probability of the optimal no-signaling provers is often useful as an 
upper bound of the acceptance probability of entangled provers (and even commuting-operator provers based on 
the notion of commuting-operator behaviors; see BTsi06[ INPA08I IDLTW08I HKPSY08 because no-signaling 
strategies have a simple mathematical characterization. Toner [Ton09j uses no-signaling provers to give the 
maximum acceptance probability of entangled provers in a certain game. Extreme points of the set of no- 
signaling strategies are also studied [BLMP PR051 IAII061 . 

Kempe, Kobayashi, Matsumoto, Toner and Vidick [KK MTV08I prove, among other results, that every lan- 
guage in PSPACE has a two-prover one-round interactive proof system which has one-sided error 1 — 1/poly 
even if honest provers are unentangled and dishonest provers are allowed to have prior entanglement of any size 
(the proof is in [KKMTV07]). Ito, Kobayashi and Matsumoto [IKM09] improve their result to an exponentially 
small one-sided error by considering no-signaling provers; more specifically, they prove that the soundness of 
the protocol in [KKMTV08] actually holds against arbitrary no-signaling provers, then use the parallel repetition 
theorem for no-signaling provers [Hol09|. We note that the soundness analysis of [IKM09] is somewhat simpler 
than that of BKKMTV08L 

Repeating the protocol of [KKMTV08 ] parallelly as is done in [IKM09 ] results in the protocol identical to 
the one used by Cai, Condon and Lipton MCCL941 to prove that every language in PSPACE has a two-prover 
one-round interactive proof system with an exponentially small one-sided error in the classical world. Therefore, 
an implication of [IKM09] is that the protocol in MCCL941 has an unexpected strong soundness property: the 
protocol remains to have an exponentially small error even if we allow the two provers to behave arbitrarily as 
long as they are no-signaling. 

'Because of this connection, we use "player" and "prover" synonymously in this paper. 
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Given that the soundness analysis of protocols against no-signaling provers is perhaps easier than that against 
entangled provers, it is tempting to try to extend the result of IIK M091 to a class of languages larger than 
PSPACE. For example, is it possible to construct a two-prover one-round interactive proof system for NEXP 
which is sound against no-signaling provers? The answer is no unless EXP = NEXP because two-prover 
one-round interactive proof systems with no-signaling provers can recognize at most EXP as pointed out by 
Preda IfFrel Then what about EXP? 

1.2 Our results 

Let MIP ns (2, 1) be the class of languages having a two-prover one-round interactive proof system with no- 
signaling provers with bounded two-sided error. The abovementioned result in [IKM09] implies MIP ns (2, 1) D 
PSPACE. Preda (Prel shows MIP ns (2, 1) C EXP. 
Our main result is: 

Theorem 1. MIP ns (2, 1) C PSPACE. 

An immediate corollary obtained by combining Theorem Q] with the abovementioned result in [IKM091 is 
the following exact characterization of the class MIP ns (2, 1): 

Corollary 2. MIP ns (2, 1) = PSPACE, and this is achievable with exponentially small one-sided error, even if 
honest provers are restricted to be unentangled. 

This puts the proof system of BCCL941 in a rather special position: while other two-prover one-round in- 
teractive proof systems MBFL91I |Fei91| IFL921 work with the whole NEXP, the one in MCCL941 attains the 
best achievable by two-prover one-round interactive proof systems with two-sided bounded error that are sound 
against no-signaling provers, and at the same time, it achieves an exponentially small one-sided error. 

At a lower level, our result is actually a parallel algorithm to approximately decid^l the value of a two-player 
one-round game for no-signaling players as follows. For a two-player one-round game G, w ns (G) is the value 
of G for no-signaling provers and \G\ is the size of G, both of which will be defined in Section [27TI 

Theorem 3. There exists a parallel algorithm which, given a two-player one-round game G and numbers < 
s < c < 1 such that either w ns (G) < s or w ns (G) > c, decides which is the case. The parallel time of the 
algorithm is polynomial in log|G| and l/(c — s). The total work is polynomial in \G\ and l/(c — s). 

Theorem Q] follows by applying the algorithm of Theorem [3] to the exponential-size game naturally arising 
from a two-prover one-round interactive proof system. This approach is similar to that of the recent striking 
result on the PSPACE upper bound on QIP [JJUW09] as well as other complexity classes related to quantum 
interactive proof systems, i.e. QRG(l) HJW09II and QIP(2) PUW09l Fl 

The construction of the parallel algorithm in Theorem [3] is much simpler than those used in IIJW091 IJUW091 
IJJUW 091 because our task can be formulated as solving a linear program of a certain special form approximately 
instead of a semidefinite program. This allows us to use the fast parallel algorithm for the mixed packing and 
covering problem by Young HYouOlll . 

2 The algorithm stated in Theorem[3]can be converted to an algorithm to approximate w nB (G) within an additive error in a standard 
way. See Remark[2]in Sectional 

3 Do not be confused by an unfortunate inconsistency as for whether the number in the parenthesis represents the number of rounds 
or turns, where one round consists of two turns. The "1" in QRG(l) and the "2" in QIP(2) represent the number of turns whereas the 
"1" in MIP ns (2, 1) represents the number of rounds just in the same way as the "1" in MIP(2, 1). 
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1.3 Organization of the paper 

The rest of this paper is organized as follows. Section [2] gives the definitions used later and states the result 
by Young MYouOlfl about a fast parallel approximation algorithm for the mixed packing and covering problem. 
Section [3] proves Theorem Q] assuming Theorem [3] Section [4] proves Theorem [3] by using Young's fast parallel 
algorithm. Section [5] concludes the paper by discussing some natural open problems. 

2 Preliminaries 

We assume the familiarity with the notion of multi-prover interactive proof systems. Readers are referred to the 
textbook by Goldreich IIGol08ll . 

2.1 Definitions on games 

A protocol of a two-prover one-round interactive proof system defines an exponential-size game for each in- 
stance. Here we give a formal definition of games. 

A two-prover one-round game, or simply a game in this paper, is played by two cooperative provers called 
the prover 1 and the prover 2 with help of a verifier who enforces the rule of the game. A game is formulated 
as G = (Q\, Q2, Ai, A2, 7r, R) by nonempty finite sets Q\, Q2, A\ and A2, a probability distribution tt over 
Qi x Q2, and a function R : Q\ x Q2 x A\ x A2 — > [0,1]. As is customary, we write R(qi, 92, «l> ^2) as 
R(a 1 ,a 2 I 91,92)- 

In this game, the verifier generates a pair of questions (91,92) G Qi X Q2 according to the probability 
distribution n, and sends 91 to the prover 1 and 92 to the prover 2. Each prover v (y G {1,2}) sends an 
answer a v G A v to the verifier without knowing the question sent to the other prover. Finally, the verifier accepts 
with probability R(a%, a2 [ 91, 92) and rejects with probability 1 — R(a\, 02 | 9i, 92)- The provers try to make 
the verifier accept with as high probability as possible. 

The size \G\ of the game G is defined as |G| = [Qijl^H^iH^bl- 

A strategy in a two-prover one-round game G is a family p = (p qiq2 ) of probability distributions on Ai x yl 2 
indexed by (91, 92) G Qi x Q2. As is customary, the probability p qiq2 {ai, 02) is written as p(ai, 02 | 91, 92)- A 
strategy p is said to be no-signaling if it satisfies the following no-signaling conditions: 

• The marginal probability p\{a\ | 91) = ^2 a2 p(oi, «2 I 91, 92) does not depend on 92. 

• The marginal probability ^2(^2 I 92) = J2 ai P( a ii a 2 \ 9i, 92) does not depend on q±. 
The acceptance probability of a strategy p is given by 

^2 nfaite) ^2 R ( a i> a 2 I 9i,92)p(ai,a 2 | 91,92)- 
9ieQi,g2GQ2 016^1,026/12 

The no-signaling value w ns (G) of G is the maximum of the acceptance probability over all no-signaling strate- 
gies. 

2.2 Definitions on interactive proof systems 

Let X = {0, 1}. A two-prover one-round interactive proof system is defined by a polynomial I: Z>o — > Z>o, 
a polynomial-time computable mapping : S* x S* — > S* x S* such that x G S n and r G S^ n ^ imply 
M ff (x,r) G S'( n ) x S ; ( n ), and a polynomial-time decidable predicate M R : S* x S* x S* x S* -» {0, 1}. On 
receiving an input string x G S*, the verifier prepares an l(|x|)-bit string r uniformly at random and computes 
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(<Zi! Q2) = M n (x, r). Then he sends each string q v (y E {1, 2}) to the prover v and receives an /(|x|)-bit string a u 
from each prover v. Finally he accepts if and only if Mr(x, r, 01, a 2 ) = 1. This naturally defines a game 

(QP , > ^ > ^ > 7r(s) > i?(a:) ) for each in P ut strin § ^ wnere Qi ] = Q { 2 ] = A i ] = A 2 ] = S ,(|x|) , 



i? (a;) (ai 



^(91,95) = . #{r G E J(I*I) I M ff (x,r) = (qi,q 2 )}, 

#{r G S'd^D I M^{x,r) = q 2 ) A M R (x, r, ai, a 2 ) = 1} 



«2 I 91, 92 J 



#{reE«M) |M w (x,r) = (gi,g !2 )} 



Let c, s : Z>o — > [0, 1] be functions such that c(n) > s(n) for every n. The two-prover one-round interactive 
proof system is said to recognize a languagJ^ L with completeness acceptance probability at least c(n) and 
soundness error at most s{n) with no-signaling provers when the following conditions are satisfied. 

Completeness x E L =^> w ns (G^) > c(\x\). 

Soundness x L =^ w M (G^) < s(\x\). 

In particular, the proof system is said to recognize L with bounded errors with no-signaling provers if the 
binary representations of c(n) and s(n) are computable in time polynomial in n and there exists a polyno- 
mial / : Z>o — > Z>i such that for every n, it holds c(n) — s(n) > 1/ f(n). We denote by MIP ns (2, 1) the class 
of languages L which are recognized by a two-prover one-round interactive proof system with bounded errors 
with no-signaling provers. 

2.3 Mixed packing and covering problem 

The mixed packing and covering problem is the linear feasibility problem of the form 

Find x E R N , 

Such that Ax < b, 

Cx > d, 
x>0, 

where matrices A, C and vectors b, d are given and the entries of A, b, C, d are all nonnegative. For r > 1, an 
r-approximate solution is a vector x > such that Ax < rb and Cx > d. 

Theorem 4 (Young [YouOl]). There exists a parallel algorithm which, given an instance (A,b,C,d) of the 
mixed packing and covering problem and a number e > 0, either: 

• claims that the given instance does not have a feasible solution, or 

• finds a (1 + e) -approximate solution. 

If the size of A and C are M\ x N and M2 x N, respectively, then the algorithm runs in parallel time polynomial 
in log M%, log Mi, log N and 1 je and total work polynomial in M\, M% N and 1/e. 



4 Although we define MIP ns (2, 1) as a class of languages in this paper to keep the notations simple, we could alternatively define 
MIP ns (2, 1) as the class of promise problems |ESY84, Gol05| recognized by a two-prover one-round interactive proof system with 
no-signaling provers. A generalization of Theorem[T]to the case of promise problems is straightforward. 
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3 Proof of Theorem [I] 



Theorem Q] follows from Theorem [3] by a standard argument using the polynomial equivalence between space 
and parallel time BBor77ll . 

Let L G MIP ns (2, 1), and fix an two-prover one-round interactive proof system which recognizes L with 
bounded errors with no-signaling provers. Let c(n) and s(n) be the completeness acceptance probability and the 
soundness error of this proof system, respectively. We construct a polynomial-space algorithm which recognizes 
L. 

Let x be an input string and n = \x\. Let G^ = (Qi , Q 2 i > A^ ,tt^ x \ R^) be the game naturally 
arising from the proof system on input x. The size of Q 2 . , A± is at most exponential in n. For 

each gi, (?2, 0.1,0.2, it is possible to compute n^ x '(qi, q 2 ) and RS x >(ai,a2 \ qi, 52) in space polynomial in n by 
simulating every choice of randomness of the verifier. By Theorem 4 of Borodin [Bor77 ], the parallel algorithm 
of Theorem |3]can be converted to a sequential algorithm which runs in space polynomial in log|G| and l/(c— s). 
By applying this algorithm to the game G^ x > and the threshold values c(\x\) and s(|x|), we decide whether 
Wns(G^) > c(\x\) or w ns (G^) < s(\x\), or equivalently whether x G L or x ^ L, in space polynomial 
in loglG^I = poly(n) and l/(c(|x|) — s(|xj)) = poly(n). 

Note that the composition of two functions computable in space polynomial in \x\ is also computable in 
space polynomial in \x\, which can be proved in the same way as Proposition 8.2 of [Pa p94[ . 



4 Formulating no-signaling value by mixed packing and covering problem 

This section proves Theorem [3] 

LetG = (Qi,Q 2 ,A 1 ,A 2 ,ir,R) beagame. Letvri(gi) = E g2£ Q 2 I2) and 7T 2 (g 2 ) = T, qi eQi ^fe' «0 
be the marginal distributions. Without loss of generality, we assume that every question in Q\ and Q 2 is used 
with nonzero probability, i.e. vri(gi) > for every q\ G Q\ and 7t 2 (q 2 ) > for every q 2 G Q 2 . 

By definition, the no-signaling value w ns (G) of G is equal to the optimal value of the following linear 
program: 

Maximize vr(gi,g 2 ) R(a 1 ,a 2 \ qi,q 2 )p(a 1 ,a 2 \ qi,q 2 ), (la) 

91,92 ai,a2 

Subjectto ^p(ai,a 2 | qi,q 2 ) =Pi(«i I gi), Vgi G Qi, g 2 G Q2, «l G A 1} (lb) 

<I2 

> ^(01,02 I gi,g2) = p 2 (a 2 [ 0.2), Vgi G Q2, g2 G Q2, a 2 G A 2 , (lc) 

ai 

p(ai,a 2 I gi,g 2 ) = 1, Vgi G Qi, g 2 G Q 2 , (Id) 



0.1,0,2 



p(ai,a 2 I gi,g 2 ) > 0, Vgi G Qi, g 2 G Q 2 , o.\ G A 1; a 2 G A 2 . (le) 

We transform this linear program (Q]) successively without changing the optimal value. First, we replace the 
constraint (fTdl ) by two constraints: 

^2pi(oi \ qi) = 1, Vgi G Qi, (2a) 

ai 

^p 2 (a 2 I g 2 ) = 1, Vg 2 GQ 2 - (2b) 

02 

It is clear that this rewriting does not change the optimal value. 
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Next, we relax the constraints ([Tbl and (flcl) to inequalities: 

y^p(ai,a 2 | gi,g 2 ) <Pl(«i | 9i), Vgi G Qi, g 2 G Q 2 , o x G ,4i, (3a) 

y^p(oi,Q2 I 91,92) <P2(«2 | 92), Vgi G Qi, g 2 G Q 2 , a 2 G A 2 . (3b) 

ai 

Claim 1. r/ze optimal value w of the linear program ([7]) is equal to the maximum value w' of dial) subject to the 
constraints riTel ), ri2al ), rt2&l ), rfjol ) afto" 07il ). 



Proof. Since we only relaxed the constraints, w < w' is obvious. To prove tu > w', let {p,pi,p 2 ) be a solution 
satisfying the constraints (TTel ). (l2ab . (l2bl . (f3ab and (l3bl . We will construct p such that {p,pi,p 2 ) is a feasible 
solution of the linear program £T|) and p(ax, a 2 | q%, q 2 ) > p(ai, o, 2 \ qi,q 2 ) for every qi,q 2 , ai, a 2 . 
Fix any gi, g 2 G Q. Let 

s 9i<? 2 ( a i) = Pi( a i I 9l) - P( a i' fl 2 I 91,92), VaiG^i, 

W( a 2) =P2(a 2 I 92) - X] P(°i' a 2 I 91,92), Va 2 G A 2 . 

ai€Ai 

The following relations are easy to verify: 

s qiq2 (a 1 )>0, VaiGAi, (4a) 
* 9 i 92 (a 2 ) > 0, Va 2 G^ 2 , (4b) 

s 9i92( a i)= J] ^192(02) (=: ^92). (4c) 
a±eAi a 2 GA 2 

We define p(a\, a 2 \ q\, q 2 ) by 



p(ai,a 2 I gi,g 2 ) 



p(ai,a 2 I 91,92) + 7^^2(01)^(02), if ^192 > 0, 



p(ai,a 2 I 91, q 2 ), if F qiq2 = 0. 



Then it is clear from Eqs. d4al ) and d4bl ) that p(ai,o 2 | gi,g 2 ) > jj(ai,a 2 | 91, 9 2 ) for every 91,92,01,02- 

Eqs. flb]) and dTc]) follow from Eq. (©. □ 

Replace the variables p(oi, a 2 | 91, 92) by x(ai, a 2 | 91,92) = vr(gi, g 2 )p(ai, a 2 | 91,92)- The resulting 
linear program is as follows. 

Maximize ^ R(ai, o 2 | 91, g 2 )x(oi, a 2 | 91, q 2 ), (5a) 

Subjectto ^Jx(oi,02 I 91,92) < 7r(9i,9 2 )pi(oi I 91), V91 G Qi, q 2 G Q 2 , a\ G A±, (5b) 

(12 

^Jx(ai,02 I 91,92) < tt(9i, 92)^2(^2 I 92), V91 G Qi, 92 G Q 2 , a 2 G ^4 2 , (5c) 

^Pl(ai I 9i) = 1, V91 G Qi, (5d) 

0.1 

^p 2 (o 2 I 92) = 1, Vg 2 G Q 2 , (5e) 

0.2 

x(ai,a 2 I 91,92) > 0, V91 G Qi, 92 G Q 2 , 01 G ^1, a 2 G ^ 2 . (5f) 



7 



By the strong duality theorem of linear programming, the linear program (f5]) has the same objective value as 
the following: 

Minimize ^ z\ (91 ) + ^ z 2 (92 ) , (6a) 

91 92 

Subject to yi(qi,q 2 ,a 1 ) + y 2 (qx, 92, o 2 ) > R(a x ,a 2 | 91,92), 

V91 G Qi, q 2 G Q 2 , 01 G Al, a 2 G A 2 , (6b) 
^i(gi) > y^7r(9i; 92)2/1(91,92,01), V<?i G Qi, ai e Ai, (6c) 

92 

^2(92) > 53 tt(9i, 92)3/2(91, 92, 02), Vg2 G Q2, «2 G A 2 , (6d) 

2/i(9i, 92, ai) > 0, V91 G Qi, 92 G Q2, ai G Ai, (6e) 

2/2(91,92,02) > 0, V91 G Qi, 92 G Q 2 , a 2 G A 2 . (6f) 

Note that the constraints (I6cl-(l6fl) imply -21(91) > and z 2 {q 2 ) > 0. 

Let (z\, z 2 , 2/1, 2/2) be a feasible solution of the linear program ©. If 2/1(91, 92, 01) > 1 for some 91, q 2 ,a\, 
we can replace 2/1(91, 92, 01) by 1 without violating any constraints or increasing the objective value. The same 
holds for 2/2(91, 92, 02)- Therefore, adding the constraints 

2/i(9i,92,ai) < 1, V91 G Qi, 92 G Q 2 , ai G Ai, 
2/2(91,92,02) < 1, V91 G Qi, 92 G Q 2 , a 2 G A 2 

does not change the optimal value. 

Replacing the variables 2/1(91, 92, ai) by 1 - 2/1(91, 92, ai) and y 2 {qi,q 2 , a 2 ) by 1 - 2/2(91, 92, o 2 ), the fol- 
lowing claim is immediate. 

Claim 2. T/jg no-signaling value w ns (G) is equal to the optimal value of the following linear program. 

Minimize ^ Zi(qi) + ^ z 2 (q 2 ) , (7a) 

91 92 

Subject to 27l(9i,92,oi) + y 2 (q 1 ,q 2 ,a 2 ) < 2 -R(ai,a 2 (91,92), 



^i(9ij + 2^ 7r( .° 1 ' 02 )y i ( 0l '° 2 ' ai ) - 71-1 ( 0l 

92 

2:2(92) + 53 7r ( 01 ' 92 )y 2 (° 1 ' 02 ' a2 ) - ^o? 2 

91 

27l(9l,92,Ol) < 1, 
272(91,92,02) < 1, 

2/l(9i,92,Oi) > 0, 

2/2(91,92,02) > 0, 

21(91) > 0, 
^2(92) > 0, 

Lemma 5. Le/ G = (Qi, Q 2 , A\, A 2 , ir, R) be a game and < s < c < 1. Consider the instance of the mixed 
packing and covering problem consisting of a constraint ^ z\ (91 ) + z 2 (q 2 ) < s and the constraints (17 b\)— 
(f^l). Let e = (c - s)/4 77ie«, 



V91 


G Qi, 


92 G Q 2 , 


ai G Ai, a 2 G A 2 , 


(7b) 


V91 


G Qi, 


01 G Ai, 




(7c) 


V92 


G Q 2 , 


02 G A 2 , 




(7d) 


V91 


G Qi, 


92 G Q 2 , 


01 G Ai, 


(7e) 


V91 


G Qi, 


92 G Q 2 , 


02 G A 2 , 


(7f) 


V91 


G Ql, 


92 G Q 2 , 


01 G A x , 


(Vg) 


V91 


G Qi, 


92 G Q 2 , 


02 G A 2 , 


(7h) 


V91 


G Qi, 






(Vi) 


V92 


gQ 2 - 






(7j) 
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(i) Ifw QS (G) < s, this instance has a feasible solution. 

(ii) Ifw ns (G) > c, this instance does not have a (1 + e)-approximate solution. 
Proof. (i) Clear from Claim |2] 

(ii) We prove the contrapositive. Assume that (f/i, 2/2, z±, z 2 ) is a (1 + e)-approximate solution, and let 

27l(9i, 92, «i) = Y^27l(9i,92,ai), 

2/2(91,92,^2) = ^-^2/2(91,92,02), 

4 (9i) = ^l(9l) +£7Tl(9i), 
4(92) = ^2(92) +£^2(^2)- 

Then (y[,y 2 , z[, z 2 ) satisfies (l7bl)-(f7jT>> and 

^Aili) + ^^(92) = ^Ztfa) + e^vr 1 (g 1 ) + ^^(92) +£^^2(92) < s + 3e < c. 

91 92 91 9i 92 92 

Therefore, w ns (G), or the optimal value of the linear program (Q, is less than c. □ 

Proof of Theorem\3\ Apply Theorem[4]to the instance of the mixed packing and covering problem in Lemma[5] 

□ 

Remark 1. It is easy to see that adding the constraints Zi(qi) < iri(qi) for q\ E Q\ and 22(92) < ^2(92) 
for q 2 € Q2 to the instance of the mixed packing and covering problem in Lemma [5] does not change the 
feasibility or approximate feasibility. The resulting linear program has a constant "width" in the sense stated in 
Theorem 2.12 of Plotkin, Shmoys and Tardos [PST95 ] with a suitable tolerance vector. See [PST95 ] for relevant 
definitions. This gives an alternative proof of Theorem [3] which uses the algorithm of [PST95 ] instead of the 
algorithm of HYouOII . 

Remark 2. Given Theorem |3l it is easy to approximate w ns (G) within additive error e (rather than deciding 
whether w ns (G) < s or w DS (G) > c) in parallel time polynomial in log|G| and 1/e and total work polynomial 
in |G| and 1/e. This can be done by trying all the possibilities of s = ke and c = (k + l)e for integers k in the 
range 0<fc<l/ein parallel, or by using the binary search. 



5 Concluding remarks 

This paper gave the exact characterization of the simplest case of multi-prover interactive proof systems with 
no-signaling provers: MIP ns (2, 1) = PSPACE. A natural direction seems to be to extend this result to show a 
PSPACE upper bound on a class containing MIP ns (2, 1). Below we discuss some hurdles in doing so. 

• More than two provers. In the completely classical case, a many-prover one-round interactive proof sys- 
tem can be transformed to a two-prover one-round interactive proof system by using the oracularization 
technique, and therefore MIP(poly, 1) C MIP(2, 1). The same transformation is not known to preserve 
soundness in the case of no-signaling provers even when the original proof system uses three proversH 

5 The Magic Square game in |CHTW04| is a counterexample which shows that this transformation cannot be used alone to reduce 
the number of provers from three to two in the case of entangled provers because it sometimes transforms a three-prover game whose 
entangled value is less than 1 to a two-prover game whose entangled value is equal to 1 |IKPSY08|. The situation might be different in 
the case of no-signaling provers. 
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As a result, whether or not MIP ns (3, 1) C MIP ns (2,l) is unknown, and our result does not imply 
MIP ns (3,l) C PSPACE. 

To extend the current proof to MIP ns (3, 1), the main obstacle is to extend Claim [T] which replaces equa- 
tions by inequalities. It does not seem that an analogous claim can be proved for three provers by a 
straightforward extension of the current proof of Claim Q] 

• More than one round. The proof of Claim[T]seems to work in the case of two-prover systems with polyno- 
mially many rounds. However, in a linear program corresponding to ©, an upper bound on the variables 
yi and 1/2 becomes exponentially large and the current proof does not work even in the case of two-prover 
two-round systems with adaptive questions or two-prover a; (log n) -round systems with non-adaptive ques- 
tions. 

• Quantum verifier and quantum messages. The notion of no-signaling strategies can be extended to the case 
of quantum messages RBGNPOU IGut091 ( 1BGNP0U uses the term "causal" instead of "no-signaling"). 
This allows us to define e.g. the class QMIP ns (2,2) of languages having a quantum two-prover one- 
round (two-turn) interactive proof system with no-signaling provers. The class QMIP ns (2, 2) contains 
both MIP ns (2, 1) and QIP(2), and it would be nice if the method of HJUW09H and ours can be unified 
to give QMIP ns (2, 2) = PSPACE. One obvious obstacle is how to extend the fast parallel algorithm in 
[JUW09] for the special case of semidefinite programming to the case of QMIP ns (2, 2). Another obstacle 
is again Claim Q} the current proof of Claim Q] essentially constructs of a joint probability distribution 
over (qi, <72, «i, 02) from its marginal distributions over ((ft, #2, 0,1) and (qi,q2, 02), and this kind of state 
extension is not always possible in the quantum case [Wer89, Wer90]. 
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